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Abstract 



^*^ In this paper an analytic expression is given for the bounds of the dis- 

O , tribution function of the sum of dependent normally distributed random 

variables. Using the theory of copulas and the important Prechet bounds 
the dependence structure is not restricted to any specific type. Numerical 



C^ illustrations are provided to assess the quality of the derived bounds. 

B 

1 Introduction 
(N 

^ Many problems in mathematical probability theory involve the computation of 

^" the distribution function for the sum of random variables (RVs). In case of 

_j. independent RVs X and Y the distribution Fz of the sum Z = X + Y is given 

T^ by the integral 

O Fz{t)^P{X + Y<t)^ I ( I f{z~y)g{y)dy]dz (1) 



. . with / and g as the corresponding density functions of X and Y respectively. 
^ ^ If the RVs however are stochastically dependent, which is a common situation 

^^ in practice, the convolution of the marginal densities / and g in the integrand 

^ of (fTl) is no longer valid. Under the assumption that the joint density function 

^ fxY of X and Y is specified, the distribution Fz can be calculated by 

Fz{t) = P{X + Y<t)= I (JfxY{z-y,y)dy]dz (2) 

An analytical exact solution of the integral both in (II]) and (pi) is feasible for 
certain marginals /, g or joint densities fxY (e.g. the normal one). In situations 
where /, g or fxv is too cumbersome to work with, one could use Monte-Carlo 
simulation for numerical solution of the integrals. 
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In many circumstances, however, the joint density of the RVs X and Y is 
not known, despite of given marginal distributions. This leads to the question, 
whether it is possible to provide bounds for the distribution of the sum, which 
are valid for all possible joint distributions. The original problem was formulated 
by A. N. Kolmogorov: Let X and Y be RVs with given distributions Fx and 
Fy. Find bounds G^ (upper bound) and G^ (lower bound) for the distribution 
G of the sum Z = X + Y, such that 



G''{z)^miP{X + Y <z) (3) 

G^ (z) = sup P{X + Y <z) (4) 



where the infimum and suprcmum are taken over all possible joint distributions 
having the marginal distributions Fx , Fy ■ In this situation, it is said, that the 
joint distribution has fixed margins. 

By now a rich literature is available on this subject. G.D. Makarov [T] solved 
Kolmogorov's problem via a cumbersome, ad hoc argument. Other authors, e.g. 
Frank et al. [5] have applied theory orginally studied by Frechet which leads 
to copulas naturally. The authors in [5], [6] provide examples of distributions 
for which the bounds G^ and G^ can be explicitly computed. These distribu- 
tions include the uniform, the Cauchy and the exponential families. For further 
reading the interested reader is referred to [7] - [TU] . 

In this contribution we compute explicit bounds for normally distributed 
RVs by means of the copula based theory. Compared to the work of Frank 
et al. [2] who addressed this problem 1987 by using the method of Langrange 
multipliers we will use a simple variable substitution. 

The rest of the paper is organized as follows. In Section 2 we give a brief 
outline of the copula theory and present the general theorem for bounding the 
distribution of the sum of RVs with unknown dependence. Then in Section 3 
lower and upper bounds for the distribution of the sum of normally distributed 
RVs are presented. Finally we illustrate these bounds by numerical examples in 
Section 4. 



2 Preliminaries 

2.1 Copulas 

Copulas are used in probability theory and statistics for modeling the depen- 
dence between RVs Xi,X2,-.-,Xd. at a deeper level allowing to understand de- 
pendence measures different from a simple correlation coefficient approach. For 
any natural d, a (d-dimensional) copula is a distribution function on [0, 1]'' with 
standard uniform margins. 

For the formal definition of a copula as well as a general introduction into 
the wide fields of copulas the reader is referred to [3], [1]. In this paper the 
following theorems are of particular importance. 
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Theorem 1 (Sklar's Theorem) Let H be a joint distribution function for the 
RVs X and Y with marginals F and G. Then there exists a copula C such that 
for all x, J/ e M; 

H{x,y)^C{F{x),G{y)) (5) 

Theorem 2 (Prechet Bounds) Let {u, v) e P = ([0, 1], [0, 1]) C M^, then for 
every copula C and every {u,v) G I^ the Frechet-bounds inequality is valid: 

W{u, v) < C{u, v) < M{u, v) (6) 

with M{u,v) = rain{u,v) and W{u,v) = max(u + u — 1,0) as the Frechet 
bounds. 

The Frechet bounds incquahty together with Sklar's Theorem leads to the 
Frechet bounds for the joint distribution function H , 

max{F{x) + G{y) - 1, 0) < H{x, y) < min{F{x),G{y)) (7) 

so the joint distribution H is bounded in terms of its own marginals. The proofs 
of these theorems as well as ([t]) can be found in [3] . 

2.2 Bounding the sum of RVs with unknown dependence 

The problem of calculating the distribution G of the sum Z — X + Y , where the 
dependencer] between X and Y is unknown, is called the Kolmogorov problem. 
Using the theory of copulas the following theorem and its proof are given in [3] . 



Theorem 3 Let X and Y be RVs with distribution functions Fx and Fy ■ Let 

G denote the distribution function of X + Y. Then 

G^(z)<G(^)<G^(^) (8) 

where 

G''iz)= snp {WiFxix),FY{y))} (9) 

G^{z)= inf {W{Fx{x),FY{y))} (10) 

with W{u^ v) ^ u ^ V — W{u^ v) — rain(u + v^ 1) and W{u^ v) as in Theorem 2. 

In the following G^ is named as upper bound and G^ as lower bound for 
the distribution G. 



A note on terminology: The term "dependence" will be used for a common measure in the 
study of the dependence betweeen RVs and therefore is not restricted to a measure of the linear 
dependence between RVs. 
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3 Bounds for the sum of normally distributed 
RVs 

Let X and Y be normally distributed with means fix , fJ-Y and standard devia- 
tions ax,<JYi denoted by X '--^ 7V(/^XjO'x)i^ ^ -^(mvi'^i')- Their distribution 
functions Fx and Fy are given by 

Fx{.) = <^('^^^) and Fyiy)^<,^y-''- 



(Tx J \ O-y 

where $ denotes the standard normal distribution 

1 /■' =2 

$(t) = -^= / e~^ds 
v Stt J-oo 



In case of ax 7^ fY the lower and upper bound for the distribution G of the 
sum Z = X + Y are given by the following proposition. 

Proposition 1 The lower bound G^ and the upper bound G^ for G are calcu- 
lated by 

V. X _ f max(*(a;i)-l,0): *(xi) > *(a;2) (,.. 

^ "^^ ^ \ max(*(a;2)-l,0) : ^(a;!) < *(a;2) ^ ' 



and 



^ I min(^(a;i),l): *(a;i) < ^(xa) .^2) 

^' 1 min(^(a;2),l) : *(a;i) > *(a;2) ^ ' 



with the function ^ : M — > M; 

and Xi,a;2 as the local extremas of ^! . 

Proof. Using Theorem 3 together with the Frechet bounds and ^ introduced 
in section 2, G"^ and G^ are given by 

G^(z) = sup max($f^^^V*f^^^^) -l.Or' (14) 

x+y=z V \ (^X J \ (TY 

G^(.) = 4n£^min{$(^)+cl>(^),l^ (15) 



For the sum of the distributions Fx and Fy in (14 1, (15) we introduce the 
function * : M ^ M: 

where the variable y has been substituted by z — x. 
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Now, we can proceed with the computation of the local extremas of ^ as 
function of one variable. Therefore the first derivative ^'{x) is built: 



*'(x) = 



1 



(3=-Mx) 



1 



(x-Cz-f,,,))^ 



Finding the zeros of ^'(x) leads to 



cry v27r 



1 



1 






(17) 



(18) 



crj!fv27r (Tyv27r 

which - after some technical calculation - is equivalent to the quadratic equation 



ax + /3x + 7 = 



with the variables a,/3,7 as follows: 

1 1 



P 



7 = 



2cr| 2crf, 
^2 ^2 



^Y 



(7 



X 



-(Z-^1')2 ^2^ 



24 



2-i 



-In 






The solutions of (19 1 are given by a;i,2 



a;i,2 



-13 ± ^/32 - 4a7 
2a 



(19) 

(20) 
(21) 

(22) 
(23) 



which are the candidates for the local extremas of the function ^(x). 



As (Jx 7^ cry (a 7^ 0) it is guaranteed that the division in (23 1 is defined. 
Moreover, any possible values for /ix, ^y i ctx, cry will lead to two solutions xi ^ 



X2 of the quadratic equation (19). In order to show this, we compute the limits 
of '5 for X — > ±00. 



/ 



lim ^(x) = lini 



a;— >- — 00 



a:— >- — CX3 



$ 



X- nx 



$ 



Z ~ X — fly 

ay 



\ 



= 1 



(24) 



lim '^{x) — lim 



$ 



X- fJ-X 

<yx 



$ 



Z — X — fiy 
cry 



= 1 



(25) 



7 



The same limits of ^ for x — > ±00 together with the fact that ^ is continuous 
but not constant on M implies that ^ has one extremum at least. If ^ had only 
one extremum there would be exactly one solution xi = 0:2 (zero of order 2) of 
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(19). A zero of order 2 for "^'(x) = however cannot be a extreme value. So, 
in any case there exist exactly two extreme values. 

Now we consider the set U :— {^(xi) — 1,^(2:2) — 1,0}, which contains 
both a minimum and maximum value. If ^(xi) > ^(2:2) then max([/) = 
max(^(a;i), 0), otherwise max(f/) = max(\E'(a;2),0). As for each set M having 
a maximum value there is supM = maxM, equation ( |Tl| ) of Proposition 1 
is shown. Similarly equation ( |12[ ) of Proposition 1 can be proven using the 
set V := {*(a;i),*(a;2),l}. If *(a;i) < *(a;2) then min(C/) = min (*(2;i), 1), 
otherwise min(C/) = min {'i>{x2), 1). □ 

In the special case, that the RVs have the same standard deviation {ax = 
ay) the bounds G^ and G^ can be expressed in closed form by the following 
corollary. 

Corollary 1 Let ax = ay = a the bounds G^ and G^ for the distribution G 
of the sum of two normally distributed random variables are given by 



and 

1, z> PLX+ ^J^.y 

Proof. By using the same arguments as in the proof of Proposition 1, we get 



G-(.).|f('"'^^r^)' -5^- + ^>- (27) 



a = from (20). This means we have to solve a linear equation jSx + 7 = 0. 



From (21) and (22 1 we get 



z - /iy fix _ z - fiy - ^ix ,„„. 

2 2" ^ 2 V"^"/ 

a'' a'' a'' 

2f72 ^ 2a2 2a2 



' o.,2 + 9^2 0^2 V^^'' 



Then the solution xq for /?a; + 7 = is given by 

_ 7 _ z^ - 2z^Y + nl. - fij^ _ z - ^y + nx /„„^ 

"^"""^^ 2(z-^Y-Mx) " 2 ^^°^ 

As xq is a zero of order 1 of ^'(a;) = it is guaranteed, that xq is indeed a 
extreme value. Inserting xq into ^(x) leads to 



*K) 




(31) 



4 ILLUSTRATIVE EXAMPLES 



li z > fix + fJ'Y then the argument of the distribution function $ in ( |3l[ ) is 
positive. As <i>(i) > 0.5 for any positive value t g K we have \E'(a;o) > 1. 
Conversely, if z < ^x + fJ-Y then the argument of the distribution function $ in 



(31) is negative and as a result ^(xq) < 1. Together with the equations for G^ 
and G^ in (14), (15) the corollary is proven. D 



Remark 1 The result of corollary 1 is in accordance with the result of M. J. 
Frank 121. However the bounds from Frank in the common case for different 
standard deviations (ax 7^ ay) cannot be reproduced. Simple numerical exam- 
ples have revealed, that Frank's bounds have also negative values, which is in 
contradiction to a distribution function. 

4 Illustrative Examples 

In this section we want to illustrate the Proposition 1 in section 3 by a simple 
example with the following prerequisites: 

• The RVs X and Y are normally distributed, where the parameters /i and 
a are chosen as follows: X ~ iV(l,0.1), Y - 7V(1.5, 0.15). 



• 



For modeling the dependence amongst X and Y two different strategies 
are used: I) X,Y are bivariate normal distributed with a given correlation 
coefficient p. 2) The dependence of X and Y is modeled using either the 
Clayton (C^^) or the Gumbel {C^") Copula g]. 

Cf^{ui,U2) = exp(^-{{-ln{m)f + {-ln{u2))')"^) 

These copulas have been chosen, because the Gumbel (Clayton) copula 
turns out to have upper (lower) tail dependence. A method for generating 
realizations from a particular copula within Matlab is described in |11] . 
The value 9 has been set to 2.5 for both copulas. 



• 



The bound computation G^ and G^ as well as generating dependent ran- 
dom variables using the concept of copulas is done via Matlab. 

In figure [l] the bounds G^ , G^ are compared to the sum distribution F 
assuming X, Y are bivariate normally distributed. The corner cases p = and 
p = 1 have been used for the bivariate normal distribution. 

In figure [2] the bounds G^ , G^ are compared to the sum distribution F 
assuming that X, Y are dependent either by Clayton or Gumbel copula. 
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